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[2, 



, 3, CompI 



phrase 

SYN : 

HEAD 
CASE : 

Due to this last item, the next application of Ta t w adds to 73 the following item: 



nj 

head 

AGR : [agr] 
easel 



(14) 



[1. 



phrase 

SYN : 

HEAD 



SBCT 



head 

AGR : 



agr 

NUM : [sg] 



nelist 

1ST : [1] 
RST : [elist] 







phrase 

SYN : 

HEAD 
CASE : 



nj 

head 

AGR : [agr] 
accl 



,3, Act] (15) 



and since it unifies with the body of rule 3, completion can be applied and the following item is 
obtained in I4: 

phrase 

SYN : M 



[1. 



head 

AGR : 
SBCT : lelistl 



HEAD : [T] 



agr 

NUM : [sg] 



, 3, CompI 



(16) 



This item combines with item 13 to create in I5, by applying dot movement according to rule 1, 
the item: 



[0, 



phrase 

SYN : 



HEAD 



nj 

head 



AGR: {3} 



agr 

PERS 
NUM : 



3rd] 



phrase 

SYN : 

HEAD 
SBCT : 



head 

AGR: [3} 
elistl 



,3, Act] (17) 



CASE : 

which unifies with the body of rule 1 so that completion can generate 



[0, 



phrase 

SYN : 



SUBJ 



HEAD 



head 

AGR : {3} 
head 

AGR: [3} 



agr 

PERS 
NUM : 



3rd] 



, 3, CompI 



(18) 



This final item is more specific than the initial symbol of the grammar; it spans the entire 
input; hence the input "John loves fish" is a sentence of the grammar. 
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[1. 



word 




SYN : | 


v] 




head 


HEAD : 


AGR : 




nelist 


SBCT : 


1ST : 




RST : 



agr 

NUM : 

phrase 
_SYN : 
elist] 



,2, CompI 



(8) 



[2, 



word 

SYN : 

HEAD 
CASE : 



nj 

head 

AGR : 
easel 



[agrj 



, 3, CompI 



(9) 



If 7 = <j> then I\ contains only those items that are always added. I2 = Ta^wih) and hence 
dot movement (1) can be applied to the prediction items and the lexical items. By unifying the 
first element of rule 1 with the lexical entries of both "John" and "fish", two items are generated: 



[0, 



phrase 

SYN : 



HEAD 
CASE : 



nj 

head 

AGR : 



agr 

PERS 
NUM : 



3rd] 



1, Act] 



(10) 



[2, 



phrase 

SYN : 



[agrj 



HEAD 
CASE : 

Unification of the lexical entry of "loves" with rule 3 generates the following item: 



nj 

head 

AGR : 
noml 



,3, Act] 



(11) 



[1. 



word 




SYN : | 


v] 




head 


HEAD : 


AGR : 




nelist 


SBCT : 


1ST : 




RST : 



agr 

NUM 



g g 



phrase 
_SYN : [n] 
elist] 

Both lexical items for "John" and for "fish" can unify with the body of rule 2 and thus the 
completion (2) operation is applicable, and the following two items are generated: 



,2, Act] 



(12) 



[0, 



phrase 

SYN : 



HEAD 
CASE : 



nj 

head 

AGR : 



agr 

PERS 
NUM : 



3rd] 

sg] 



1, CompI 



(13) 
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Initial symbol: 



"phrase 

SYN : 

HEAD : 
CASE : 



phrase 

SYN : 



Rules 



head 

AGR : 







"word 

SYN : 
HEAD 

SECT : 



phrase 

SYN : 

HEAD : 
SECT : 







nelist 

1ST : 
RST : 







v| 

head 
AGR : 
elist J 

word 

SYN : 
HEAD 
CASE : 



phrase 

SYN : 
HEAD : 
CASE : 







" J 1 
head 

acc 



"phrase 

SYN : 
SUB J : 
HEAD : 

"phrase 

SYN : 
HEAD : 
_CASE : 

"phrase 

SYN : 
HEAD : 
SECT : 



John 



fish 



word 

SYN : 



HEAD : 



CASE 

word 

SYN : 



HEAD : 

SECT : 

word 

SYN : 

HEAD : 
CASE : 



"head 

AGR : 



head 
AGR : 
"nelist 

1ST : 
_RST : 

head 

AGR : 
case 



agr 

PERS : 
NUM : 



agr 

NUM : 



phrase 

SYN : 



elist 



agr 



3rd J 

sgl 



sg 



Figure 7: An example grammar 



scanning (5) operation: 



[0, 



word 

SYN : 

HEAD 
CASE : 



head 

AGR : 



agr 

PERS 
NUM : 



3rdJ 



1, CompI 
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coincide, namely that the computational process induced by the algebraic specification is correct 
with respect to the declarative specification. Finally, we formally characterized a subset of the 
grammars, off-line parsable ones, for which termination of parsing can be guaranteed. Making use 
of the well-foundedness of the subsumption relation, we proved that for every grammar in this 
class, parsing is finitely terminating. 
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A Examples 

An example of a TFS-based grammar is given in figures 6 and 7. The signature, containing the 
type hierarchy, the appropriateness specification and the start symbol are depicted in figure 6, 
whereas the rules and the lexicon are displayed in figure 7. 



Type hierarchy: 

jst 2nd 3rd nom acc 




Appropriateness: 



sign 




SYN : 


syn] 


HEAD : 


head 


CASE : 


case] 


SUB J : 


head 


SBCT : 


list] 



nelist 

1ST : 
RST : 



signj 
listl 



head 

AGR : 



agr 



agr 

PERS 
NUM : 



persj 
numl 



Figure 6: An example type hierarchy 

We simulate the process of parsing with the example grammar of figure 7 and the input "John 
loves fish" . 

Some items are added by every application of Ta t w'- the items [i,X,i, Act] for < i < 3 
are added due to the prediction (3) operation; the following three items are added due to the 
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Definition 4.26 (Strong off-line parsability) A grammar G is strongly off-line parsable iff 

there exists an FRG-function F from AMRSs to AMRSs such that for every string w and AMRSs 
A, B, tfA^A'e PT w (i + l,j) and BAB'e PT w (i + l,j) then F(A) ^ F(B). 

As [16] points out, "there are non-off-line-parsable grammars for which termination holds". We 
use below a more general notion of this restriction: we require that F distinguish A and B only if 
they are incomparable with respect to subsumption. 

Definition 4.27 (Weak off-line parsability) A grammar G is weakly off-line parsable iff 

there exists an FRG-function F from MRSs to MRSs such that for every string w and MRSs A, B , 
if A A A' £ PT w (i + l,j), B A B' £ PT w (i+ 1, j), A£B and B £ A, then F(A) ^ F(B). 

Theorem 4.28 // G is weakly off-line parsable and AMRSs are acyclic then every computation 
terminates. 

Proof: Fix a computation triggered by w of length n. We claim that there is only a finite number 
of generable items. Observe that the indices that determine the span of items are bounded: 
< i < j < n. It remains to show that only a finite number of AMRSs are generated. Let 
x = [i,A, j, c] be a generated item. Suppose another item is generated where only the AMRS is 
different: x' = [i, B, j, c] and A ^ B. If A C B, x' will not be preserved because of the subsumption 
test. There is only a finite number of AMRSs A' such that A' C A (since subsumption is well- 
founded for acyclic AMRSs). Now suppose A ^ B and B ^ A. By the parsing invariant (a) there 
exist A',B' such that A A A' £ PT w (i + 1, j) and B A B' £ PT w (i + 1, j). Since G is off-line 
parsable, F(A) ^ F(B). Since the range of F is finite, there are only finitely many items with 
equal span that are pairwise incomparable. Since only a finite number of items can be generated 
and the computation uses a finite number of operations, the least fix-point is reached within a 
finite number of steps. 

The proof relies on the well-foundedness of subsumption, and indeed the proposition doesn't 
hold for cyclic TFSs. If cyclic TFSs are allowed, the more strict notion of strong off-line parsability 
is needed. Under the strong condition the above proof is applicable for the case of non- well-founded 
subsumption as well. 

To exemplify the difference between strong and weak off-line parsability, consider a grammar 
G that contains the following rule R: 



f ml 



t 

/ i. 



Assume that 



t 

/ I- 



B £ PT w (i,j). This TFS is unifiable with the body (LHS) of R, and 

"t 



therefore the rule is applicable. By one application of R one gets 



/ 



t 



B £ PT w (i,j). 



This new TFS is also unifiable with the body of R, and thus an infinite number of TFSs can be 
shown to derive B. G, therefore, is not strongly off-line parsable. It is, however, weakly off-line 
parsable, since all the TFS that are created by successive applications of R form a subsumption 
chain. Thus termination is guaranteed for G by theorem 4.28. 



5 Conclusions 

We have formalized in this paper the concepts of grammars and languages for linguistic formalisms 
that are based on typed feature structures, using the notion of multi-rooted structures that gener- 
alize feature structures. We use multi-rooted structures for representing grammar rules as well as 
(the equivalent of) sentential forms that are generated during parsing. We described a computa- 
tional process that corresponds to parsing with respect to such formalisms. We thus achieved two 
different specifications, namely a declarative (derivation-based) one and an algebraic (computation- 
based) one, for the semantics of those formalisms. Next, we have proved that the two specifications 
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Items are generated by the dot movement (1) operation since the conditions for its application 
obtain: it is easy to see that the indices (i, j) match; in addition, if for some m, Ci... m C D x - x + m - 1 ; 
and for every k, Cj. C A k , then there exists Ci... m +i that is obtained by unifying some R □ Abs(p) 
first with Ci... m and then with C m+1 , such that Ci... m +i C £)a;...a;+m ag re q U i rec l Therefore, 
by induction on m it can be shown that all the items that result from dot movement are indeed 
generated. Finally, the completion (2) operation is applicable and (since A — ► D) we have C r \Z A y . 

Theorem 4.24 If w E L(G) then the computation triggered by w is successful. 

Proof: w G L(G), hence Abs(A s ) ~» B, where B G PT w (l,n). Hence there exist A' , B' such 
that A' U Abs(A s ) ^ T, B' □ 5 and A' 5'. By the parsing invariant, there exists / such that 
[0,C, n, Comp] G Ij where C C A'. Hence C U Abs(A s ) ^ T, and therefore the computation is 
successful. 

4.3.3 Subsumption check 

To assure efficient computation and eliminate redundant items, many parsing algorithms employ 
a mechanism called subsumption check (see, e.g., [16, 18]) to filter out certain generated items. We 
introduce this mechanism below and show that it doesn't effect the correctness of the computation. 

Define a (partial) order over items: A\ , j\, c{\ ■< \%2, A2, 32, C'j\ iff H = h,3i = 32, ci = c r z 
and A\ C i 2 - Modify the ordering on sets of items as follows: I\ ■< I2 iff for every x\ G I\ there 
exists X2 G I2 such that X2 ■< x\. Sets of items are no longer ordered by inclusion, but rather by 
a weaker condition that only requires the existence of a more general item (in the higher set) for 
every item (in the lower set). 

The subsumption filter is realized by modifying Ta t w'- x £ Tg iIB (/) only if there does not exist 
any item x' G Ta t w(I) such that x' ^ x. Namely, for all items that span the same substring and 
have the same status (Act or Comp), only the most general one is preserved across successive 
applications of Ta t w ■ Given the new ordering of sets of items, it can be shown that this modification 
does not harm neither monotonicity nor continuity, and hence every computation is guaranteed 
to reach a least fix-point. Obviously, the soundness of the computation is also maintained. More 
interestingly, completeness is preserved, too: recall that the parsing invariant (b) states that if 
A A A' □ B then for every k some item [ij., C'k,jk, Comp] is generated such that C'k Q A . Since 
the subsumption test only leaves out an item if a more general one exists, the invariant still holds 
and hence the correctness of the computation is guaranteed. Notice that if 1(G) would have been 
defined as the set of strings that are derivable from the start symbol itself, the subsumption check 
might have removed crucial items, and the computation could cease to be correct. 

4.3.4 Termination 

It is well-known (see, e.g., [13, 9]) that unification-based grammar formalisms are Turing-equivalent, 
and therefore decidability cannot be guaranteed in the general case. However, for grammars that 
satisfy a certain restriction, termination of the computation can be proven. We make use of the 
well-foundedness result to prove that parsing is terminating for off-line parsable grammars. 

Off-line parsability was introduced by [10] and adopted by [13], according to which "A grammar 
is off-line parsable if its context-free skeleton is not infinitely ambiguous". As [9] points out, this 
restriction (defined in slightly different terms) "ensures that the number of constituent structures 
that have a given string as their yield is bounded by a computable function of the length of that 
string". The problem with this definition is demonstrated by [7]: "Not every natural unification 
grammar has a context-free backbone". To overcome this problem, [7] uses a different restriction: 
"A grammar is depth-bounded if for every I > there is a D > such that every parse tree for a 
sentential form of I symbols has depth less than £)" . [16] generalizes it and we use an adaptation 
of his definition below. 

Definition 4.25 (Finite-range generalizing functions) A function F : D — ► D, where D is a 
partially- ordered domain, is finite-range generalizing (FRG) iff the range of F is finite and for 
every d G D, F(d) ^ d. 
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1. dot movement: x = [i a , C 1 jp } Act] where there exist a,j3 G if-i as required and C = 

(B, {k + 1}) U Ap, B = (i?, {1 . . .k}) U A a . By the induction hypothesis, there exist A' a , B a 
such that A a j4' a and A' a □ 5„ G PT(i a + 1, Also, there exist A'^,Bp such that 
A/, A A'p and A'p Z\ Bp £ PT(j a + l,jp). B x - k = (R,{1 . . . k}) U A a ; if k > 0, A« ^ A and 
by lemma 4.20 A« □ i?, hence = A«. If = 0, = A = A«. Hence A A' a . 

C 1 □ B 1Jc , and by lemma 4.7 there exists A" □ A' a such that C 1 A". In the same 
way, there exists A£ □ A' p such that C* &+1 A By lemma 4.8, C 1 - k+1 A A" • A" p . But 
^4" • A£ □ K -A'pUBc-Bp, and since B a G PT(i a + 1, j«) and Bp G PT(j a + l,jp), by 
lemma 4.2 5„ • 5^ G PT(i a + 1, j/3). The cases in which i a = j„ or = jp are trivial. 

2. completion: x = C m ,j a , Comp] where C = (i?, {1 . . . m — 1}) U j4„ and there exist an 

abstract rule i? and an item a G h-i as required, and (by lemma 4.20) A*"™ -1 □ i? 1 ••• m_1 . 
If *'« < ia then by the induction hypothesis, there exist A' a ,B a such that A a A' a and 
A' a □ B a G PT(i a + 1, C = (R,{l...m- 1}) U A«, hence C 1 -™- 1 = A a and thus 

C l...m-1 From l emma 4. 9; C*™ _^ c-l.-.m-^ and thug C m _^ ^ j f ^ = ^ then 

A a and hence C m A. 

3. prediction: x = [i, A, i, Act] and PT(i + 1, i) = <f>. 

4. e-rules: a; = [i, Abs(p), i, Comp] and PT(i + 1, i) = <f>. 

5. scanning: x = [i— 1, j4&s(j4 8 ), i, Comp] where Ai G Cat(wi), and j4&s(j4 8 ) j4&s(j4 8 ) trivially. 

A&s( J 4 i ) G PT(i+ l,j) by definition. 

Theorem 4.22 // a computation, triggered by w, is successful, then w G L(G). 

Proof: If a computation is successful then there exists some m > such that x = [0, A, n, Comp] G 
I m where len(A) = 1 and A U Abs(A s ) ^ T. By the parsing invariant, A A 1 for some A 1 □ B G 
PT„(l,n). Hence A6s(A s )~» 5 and w G i(G). 



4.3.2 Completeness 

The following theorem shows that one derivation step, licensed by a rule Abs(p) of length r, 
corresponds to r+l applications of Tg, w , starting with an item that predicts the rule and advancing 
the dot r times, until a complete item for that rule is generated. 

Theorem 4.23 (Parsing invariant (b)) If A A' and A' □ B G PT w (i + 1, j) then for every 
k, < k < len(A), there exists 4 such that [ik, Ck, jk, Comp] G Ij k , where Ck Q A k , i\ = i, 

Hen(A) = i and i k = ik + 1 l f k< l en ( A )- 

Proof: By induction on /, the number of derivation steps from A to A': 

If / = 0, A = A' □ B. Since B G PT w (i + l,j), B = Abs(A i+1 ) ■ . . . ■ Abs(Aj) where A k G Cat(w k ) 
for i + 1 < k < j. The scanning operation (5) of Tg, w adds appropriate items whenever it is 
applied. 

Assume that A — ► D — ► B □ PT w (i+l, j) and the proposition holds for D and B. By the induction 
hypothesis, for every k, < k < len(D), there exists 4 such that [ij., Ck, jk, Comp] G Ii k , where 
Ck Q A k . Suppose that A D through a rule p of length r by expanding A y to D x x + r ~ 1 . 
Then the following sequence of items is generated, where for every in , C\ m C D x "' x m , and 
C r \ZA y : 

by prediction (3) 
by the induction hypothesis 
by dot movement (1) 
by the induction hypothesis 
; 2 ) by dot movement (1) 

[i, Ci... r _i, j r _i, Act] G I ma x(h^^,i r . 1 ) by dot movement (1) 
[i, C r ,j, Comp] G I max (i 1 ,...,i r -i)+i by completion (2) 



'i, A, i, Act] 


G 


h 


>'i,Ci,ii,CoMP] 


G 




>',C*i,ii, Act] 


G 




>'2,C 2 ,i 2 ,COMP] 


G 




'i,Ci... 2 ,32, Act] 


G 
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and hence x G \Ji>o T G,w(Ii)- 

If a; 6 Uj>o Tg,w (Ii) then there exists some i that x G Ta t w(Ii)- h Q Ui>o ^ anc ^ smce Tg,w is 
monotone, Ta t w(Ii) Q ^G,u;(Ui>o ^«')> anc ^ nence * £ Tg iU) (Ui>o ^')- Therefore Tc )tl , is continuous. 

Corollary 4.16 TAe least fix-point of Ta, w can be obtained by iteratively computing i m +i = 
TG,w(Im), starting from I = <f> and stopping when a fix-point is reached. 

Proof: By Tarski-Knaster theorem, the lfp exists since Tg, w is monotone; By Kleene's theorem, 
since Tq w is continuous, the lfp can be obtained by applying the operator iteratively, starting from 
</,. 

Definition 4.17 (Algebraic meaning) The meaning of a grammar G with respect to an input 
sentence w is the least fix-point of the operator Tq , w ■ 

Definition 4.18 (Computation) The w - computation triggered by w G Words* is the infinite 
sequence of sets of items Ii,i > 0, such that I = <f> and for every m > 0, I m +\ = TG,w{Im)- 
The computation is terminating if there exists some m > for which I m = i m +i (i.e., a fix- 
point is reached in finite time). The computation is successful if there exists some m such that 
[0, A, n, Comp] G I m , where len(A) = 1 and A U Abs(A s ) ^ T; otherwise, the computation fails. 

Notice that we check whether the generated items are unifiable with the initial symbol, in ac- 
cordance with the definition of languages. If the initial symbol of the grammar is interpreted 
differently when languages are defined, a corresponding modification has to be made in the condi- 
tion for success. 

An example of parsing with respect to the example grammar given in figure 7 is given in the 
appendix. 

4.3 Proof of Correctness 

In this section we show that parsing, as defined above, is (partially) correct. First, the algorithm is 
sound: a w-computation succeeds only if w G L(G); second, it is complete: if w G L(G), it triggers 
a successful w-computation. Computations are not guaranteed to terminate, but we show that 
termination is assured for a certain subset of the grammars that are off-line parsable. We discuss 
off-line parsability in section 4.3.4. 

4.3.1 Soundness 

In what follows we fix a particular w-computation Io, I\, ■ ■ ., triggered by some input w = w\ ■ ■ ■ w n . 
Lemma 4.19 // [i, A, j, Comp] G Ii for some I then len(A) = 1. 

Proof: By definition o{Tg, w , complete items are generated by operations 2, 3 and 4. All these 
operations add items in which the AMRS is of length 1. 

Lemma 4.20 // [i,A, j, Act] G /; for some I and len(A) = k > then there exists p G 1Z such 
that Abs(p) 1 - k C A. 

Proof: By induction on /. If / = then I { = <f> and the proposition holds vacuously. Assume 
that the proposition holds for every /' < /. Suppose that x = [i,A, j, Act] G Ii and A ^ A. Then 
x must have been added by operation 1 (dot movement). Then x = [i a , C 1 j^, Act] where 
C = ((R, {1 . . .k}) U A a ), {k + 1}) U A p , namely C □ R and thus C 1 ' k+1 □ R 1 -^ 1 . 

Theorem 4.21 (Parsing invariant (a)) If[i,A,j,c] G Ii and i < j then there exist B G PT w (i+ 
l,j) and A' □ B such that A A A'. If i = j then A A A. 

Proof: By induction on /. 

If / = then 7; = <f> and the proposition holds vacuously. 

Assume that the proposition holds for every /' < /. Suppose that x = [i, A, j, c] G if. Then x must 
have been added by one of the operations. Consider each case separately: 
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I G Items, a; G T GiW (I) iff either 



3p G Tl,Abs(p) = R = A 1 ,. . ., A m _i => A m ,m > 1 
3k < m — 1 

3a 6 J,« = j4„, j a , Act], len(A a ) = 
3/3 el, /3= [i l 3,A l 3,j l 3,CoMP],len(A l3 ) = 1 
i« = ip 

B = (R,{1 . . . k}) U A a 
C = (B, {k + 1}) U A p 

x = [i a ,C 1 ' k+1 ,jp,AcT] 

3p G Tl,Abs(p) = R = A 1 ,. . ., A m _i => A m ,m > 1 
3a G /, a = [««, j4„, j a , Act], /en(j4 a ) = m — 1 
C* = (i?, {1 . . .m - 1})U4 

x = [i a ,c m ,i«,CoMP] 



(i) 



(2) 



3i, < i < n , 
x = [i, A, i, Act] ^ ' 

or 

3p G TZ, len(p) = 1 

3i, < i < n (4) 
x = [i, Abs(p), i, Comp] 

or 

w = w\, . . . , w n , n > 1 

3i, < i < n (5) 
x = [i — l,Abs(Ai), i, Comp], Ai G Cat(wi) 

Cases 1 and 2 perform the operation known as completion: 1 moves the dot one position along 
the body of a rule, and 2 creates a complete item once the dot reaches the end of the body. Case 3 
corresponds to the prediction operation, whereas case 5 corresponds to scanning. Case 4 handles 
e-rules, i.e., rules with null bodies, and creates complete items that span a null substring of the 
input sentence. Notice that cases 3 and 4 are independent of the argument I and therefore add 
the same items in every application of Ta t w Case 5 is also independent of the argument, but is 
dependent on the input sentence w. 

The operator Ta t w , on which the algebraic semantics of TFS-based grammars is based, naturally 
induces an operational semantics for such formalisms: once the operator is shown to be continuous, 
a computational process that corresponds to the iterative application of Ta t w computes the set 
of items in the least fix-point of the operator. This process can be thought of as an analog of 
bottom-up, chart-based parsing: the chart is initialized with predictions for every rule in every 
position (by operation 3) and with complete edges for every input word (by operation 5). Then, 
operations 1 and 2 are used to apply the grammar rules using the chart, in an unspecified order. 
We prove below that the process is indeed analogous to parsing w with respect to G. 

Theorem 4.14 Ta t w ts monotone: if I\ C I2 then Ta^wih) Q ?G,m(A)- 

Proof: Suppose 7i C 7 2 - If i G Ta^wih) then x was added by one of the five operations; 
operations 3, 4 and 5 add the same items every time Ta t w is applied, and thus x G Ta^wih), too. 
If x was added by operation 1, then there exist items a, [3 in I\ to which this operation applies. 
Since I\ C I2, a, [3 are in I2, too, and hence x G Ta^wih), too. The same applies for operation 2. 
In any case, x G Ta^wih) and hence Ta^wih) Q Ta^wih)- 

Theorem 4.15 Ta t w ts continuous: if f,i > is a chain, then ?G,u;(Ui = Ui^G,w(^)- 

Proof: First, Ta t w is monotone. Second, let I = 7o Q h Q ... be a chain of items. If a; £ 
Tg,w (Ui>o then there exist a, [3 G Ui>o^' as required, due to which x is added. Then there 
exist i,j such that a G 1% and [3 G Ij ■ Let k be the maximum of i, j. Then a, [3 G Ik, % G Ta^wih) 
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Figure 5: A leftmost derivation 



Definition 4.12 (Items) An item is a four-tuple [i, A, j, c], where i, j £ IN , i < j, A is an AMRS 
and c is either Act, in which case the item is active, or Comp, in which case it is complete. 
Let Items be the collection of all items. 

If [i, A, j, c] is an item, we say that A spans the input from position i + 1 to position j (inclusive). 
A can be seen as a representation of a dotted rule, or edge: during parsing all generated items 
are such that A is (possibly more specific than) a prefix of some grammar rule. The notion of 
items usually employs edges that contain entire rules, whereas we only use prefixes of rules. This 
difference is not essential, and in an actual implementation of a parser that is induced by Tq, w , 
edges indeed include a reference to the rule on which they rely. 

In what follows we defined Tq, w , a parsing operator that corresponds (bottom-up) chart parsing. 
However, it is possible to characterize algebraic operators that correspond to other parsing schemas 
as well. 

Definition 4.13 Let Tq, w '■ 2^ TEMS — ► 2^ TEMS be a transformation on sets of items, where for 
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all sentences. Moreover, it makes sense to include in the language such strings that are not 
derived directly by the start symbol, but rather by a TFS that is related to the start symbol. 
For example, the grammar writer might state that only TFSs with a cat feature valued S are 
permissible, meaning that every TFS that is subsumed by the start symbol (that is, contains all 
the information it encodes) is a sentence. However, such a definition prevents the incorporation of 
subsumption test (see section 4.3.3 below) into the parsing, since the correctness of the computation 
can not be maintained. 

Due to these consideration we chose a relaxed condition on the start symbol in our definition of 
languages. We define a derivation relation between AMRSs in a way that allows the initial symbol of 
the grammar to derive a sequence of lexical entries even if the actual (strong) derivation is between 
a TFS that unifies with the start symbol and a more specific instance of the pre-terminals. 

Definition 4.10 (Derivation) An AMRS A derives an AMRS B (A ~» B) iff there exist 
AMRSs A', B' such that (A, {1, . . . , len(A)}) U A 1 ^ T, B C B' and A' A B' . 

Definition 4.11 (Language) The language of a grammar G is L(G) = {w = wi---w n £ 

Words* | Abs(A s ) ~» B for some B £ PT w (l,n)}. 

Figure 5 shows a sequence of derivation steps, starting from some feature structure that is more 
specific than the initial symbol and ending in a sequence of structures that can stand for the string 
"John loves fish", based upon the example grammar. While we use identical tags within different 
MRSs it must be understood that the scope of tags is limited to a single AMRS. When two AMRSs 
are related by derivation, there can be no reentrancies between them. The use of identical tags 
here is for explanatory reasons only. 

4.2 Parsing as Operational Semantics 

We view parsing as a computational process, capable of endowing TFS formalisms with an op- 
erational semantics, which can be used to derive control mechanisms for an abstract machine we 
design (see [19]). 

As is well known (see, e.g., [11]), the meaning of a logic program P can be specified algebraically 
as the least fix-point (lfp) of the immediate consequence operator Tp of the program. A similar 
approach can be applied to a context-free grammar G, such that L{G) equals (a projection of) 
the least fix-point of an analogous immediate derivation operator, Tq. Let G = (V, T, P, S) be 
a context-free grammar. 5 Let I C V x T* . Define Tq(I) = {(A,w) | A — ► w £ P, w £ T} U 
{(A, w\ ■ ■ -Wk) | A — ► Ai ■ ■ ■ Ak £ P, (Ai, Wi) £ I, 1 < i < k}. Then the least fix-point of Tq is the 
union over A £ V of {{A, w) \ w £ L A (G)}. 

In a sense, computing the lfp of Tq corresponds to computing the language generated by G. 
Parsing, then, amounts to checking if the input w is in the language. This process induces an 
inherently inefficient computation: since w is given, it can be used to optimize the computation. 
This is achieved by defining Ta t w , a parsing step operator, which is dependent on the input sentence 
w. The set of items I has to be extended, too: an item is a triple [i, A, j] where < i, j, < n (n being 
the length of w) and A £ V . Informally, an item [i, A, j] represents the existence of a derivation for 
the symbol A to a substring of w, namely . . .Wj . w £ Ls(G) if and only if [1, S, n] £ lfp(Ta tW ), 
so that parsing now amounts to computing the least fixed point of Ta t w, which is more efficient, 
and then checking whether the appropriate item is in the lfp. 

We now return to TFS-based formalisms and define Ta t w f° r a TFS-based grammar G, thus 
providing means for defining the meaning of G. A computation is triggered by some input string 
of words w = w\ ■ ■ ■ w n of length n > 0. For the following discussion we fix a particular grammar 
G = (1Z,A S ) and a particular input string w of length n. A state of the computation is a set of 
items, and states are related by a transition relation. The presentation below corresponds to a 
pure bottom-up parsing algorithm, as it is both simple and efficient. 

5 We assume a normal form, where for A a G P, either a G T or a G V* . 
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Figure 4: Strong derivation 



Proof: A — ► B, therefore there exists a rule p £ 1Z, an AMRS i? □ Abs(p) and an index j such 
that A unifies with the head of R, and B is obtained by replacing A 3 with the body of R. A and B 
are already "as specific as needed" ; thus, since A Q A' and A = (A, {j}) U R n , A' = (A' , {j}) U R n . 
Hence there exists i?' □ i? such that R' = (R',{n}) U A' 3 , A' unifies with its head and B' is 
obtained by replacing the j-th element of A' with its body. 

Lemma 4.7 If A —>■ B and A C A' then there exists B' such that B C B' and A' —> B' . 
Proof: By induction on the derivation sequence and lemma 4.6. 
Lemma 4.8 If A 1 - 1 * A B and A k+1 A C then A 1 ' k+1 A B -C. 

Proof: The derivation is obtained by applying first the derivation steps that derive B from A 1 * ^ 
and then those that derive C from A + . Since 

A i...k ^ B; A 

is "as specific as needed" and the 
application of the derivation steps from A to B does not affect the applicability of the derivations 
step to C . 

Lemma 4.9 If A □ Abs(p) for some p £ TZ of length n then A" — ► A 1 '™ -1 . 

Proof: Immediate from the definition of derivation. 

There are various definitions in the literature for the language that is defined by a grammar G 
expressed in a unification-based grammar formalism. For example, [15, 16] do not include a start 
symbol in the grammar at all, and define 1(G) as the set of strings derivable from some feature 
structure. In [17] a start symbol is defined (notated goal axiom), and 1(G) is defined as the set 
of strings that are derivable from some generalization of the start symbol, i.e., from some feature 
structure that subsumes it. [18], on the other hand, assumes that a specific feature cat is present in 
every feature structure (the value of which simulates non-terminals in a context-free "underlying" 
grammar), and uses this feature to single out the start symbol: 1(G) is the set of strings that 
are derivable from some feature structure in which the cat feature is S (the start symbol of the 
underlying context-free grammar). A similar definition is given by [7]: 1(G) is the set of strings 
derivable from the start symbol, where the start symbol is a constant (that is, an atomic feature 
structure). 

There is a good motivation to employ a start symbol: the grammar writer might want to 
specify a certain criterion for the permissible strings in the language, for example, that they are 
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in them in order for them to be related by derivation. This is why, in the definition below, we use 
an AMRS R that is at least as specific as some rule p, and not p itself, to guide the derivation. 
This is also why the definition requires that all the unifications do not add information, strong 
derivation is the relation that holds between such AMRSs; another relation, derivation, relaxes that 
requirement by allowing two AMRSs to be related even if they contain only part of the information 
that is required for strong derivation to hold. 

Since elements of AMRSs involve indices that denote their linear position in the sequence of 
roots, the operation of replacing some element in one AMRS with a sub-structure, whose length 
might be greater than one, becomes notationally complicated. Conceptually, though, it resembles 
very much the replacement of some symbol with a sequence of symbols in context-free derivation, 
or the replacement of the selected goal (that unifies with the head of some rule) with the body 
of the rule, in Prolog SLD-resolution. One main difference in our definition is that we do not 
carry substitutions through sequences of derivations; rather, we treat all the pairs in a derivation 
sequence as if the appropriate substitutions have already been applied to them (recall that members 
of these pairs are "as specific as needed"). 

Definition 4.5 (Strong Derivation) An AMRS A = (Ind,A, n^, 0^, pa^) of length k strongly 
derives an AMRS B (denoted A^ B) iff 

• there exist a rule p £ 1Z and an AMRS R □ Abs(p) (with len(R) = n), such that: 

• some element of A unifies with the head of R, and some sub-structure of B unifies with the 
body of R; namely, there exist j £ Ind A and i £ Inds such that: 

A= (A,{j})UR n , B LJ+n - 2 = (B,{i...i + n-2})UR 1 - n - 1 , 
R= (R, {n})UA 3 , R = (R,{l...n-l})U B LA+n ~ 2 

• B is the replacement of the j-th element of A with the body of R; namely, let 

/(0 = {; + n _ 2 !/}<!<*' 9(i) = i + J-Hn<i<» 

then B = Ty(Eq(Cl{{IndB' , , ©5/ , ~_b<}))), where 

- Ind B > = (1, • • • , k + n - 2} 

i = f(i') and (i' , ir) £ 11^ or 



(i,7r)eu BI iff ^ . = g{ . r) and 

l 0i?(« , tt) if i = g(i ) 

- (ii, 7Tl) Ki B , (i 2 , 7T 2 ) if 

* h = f(A) and *2 = /(*•>) an d , tti) Ki A {i' 2 ,TT 2 ), or 

* h = 9(A) and *2 = 9(12) and (i[,TTi) & R (i' 2 ,Tr 2 ), or 

* i 1 = f{i'i) and i 2 = g{i 2 ) and there exist 1:1,1:2,1:3 such that (i^ , i:\ ) k,a 0'; 13) an d 
(n,7r 3 ) K, R (i' 2 ,7r 2 ), or 

* i l = g{i'i) and i 2 = f(i' 2 ) and there exist 1:1,1:2,1:3 such that (i^ , 7Ti) &r (j, 1:3) and 

(n,7r 3 ) K, A (i' 2 ,TT 2 ) 

The reflexive transitive closure of » denoted is defined as follows: A A" if A = A" or 

if there exists A' such that A — ► A' and A' — ► A". 

Intuitively, A strongly derives B through some AFS A 3 in A, if some rule p £ 1Z licenses the 
derivation. A J is unified with the head of the rule, and if the unification succeeds, the (possibly 
modified) body of the rule replaces A 3 in B. The definition is graphically demonstrated in figure 4. 

Lemma 4.6 If A —>■ B and A C. A' then there exists B' such that B C B' and A' —>■ B' . 
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notion of AMRSs. This presentation is more adequate to current TFS-based systems than [7, 17], 
that use first-order terms. Moreover, it does not necessitate special, ad-hoc features and types for 
encoding trees in TFSs as [16] does. We don't assume any explicit context-free back-bone for the 
grammars, as do [10] or [18]. 

The parsing algorithm we describe is a pure bottom-up one that makes use of a chart to record 
edges. The formalism we presented is aimed at being a platform for specifying grammars in HPSG, 
which is characterized by employing a few very general rules (or rule schemata); selecting the rules 
that are applicable in every step of the process requires unification anyhow. Therefore we choose 
a particular parsing algorithm that does not make use of top down predictions but rather assumes 
that every rule might be applied in every step. This assumption is realized by initializing the chart 
with predictive edges for every rule, in every position. 

4.1 Rules and Grammars 

We define rules and grammars over a fixed set Words of words (in addition to the fixed sets 
Feats and Types). We use w to refer to elements of Words, Wi to refer to strings over Words. 
We assume that the lexicon associates with every word a set of feature structures Cat(wi), its 
category, 1 so we can ignore the terminal words and consider only their categories. The input for 
the parser, therefore, is a sequence 2 of sets of TFSs rather than a string of words. 

Definition 4.1 (Pre-terminals) Let w = wi...w n £ Words*. PT w (j,k) is defined iff 1 < 
j,k < n, in which case it is the set of AMRSs Abs((Aj , Aj + i, . . . , A^)) where Ai £ Cat(wi) for 
j < i < k. If j > k then PT w (j,k) = {A}. We omit the subscript w when it is clear from the 
context. 

Lemma 4.2 If w = w\- ■ - w n , 1 < i < j < k < n, A £ PT w (i,j) and B £ PT w (j + l,k) then 
A- Be PT w (i,k). 

Proof: An immediate corollary of the definition. 

Definition 4.3 (Rules) A rule is a MRS of length n > with a distinguished last element. If 
(Ai, . . . , A n -i, A n ) is a rule then A n is its head 3 and {A\, . . . , A n -\) is its body. 4 We write such 
a rule as (Ai , . . . , A n _\ =>• A n ) . In addition, every category of a lexical item is a rule (with an 
empty body). We assume that such categories don't head any other rule. 

Notice that the definition supports e-rules, i.e., rules with null bodies. 

Definition 4.4 (Grammars) A grammar G = (1Z,A S ) is a finite set of rules 1Z and a start 
symbol A s that is a TFS. 

An example grammar, whose purpose is purely illustrative, is depicted in figures 6 and 7 in the 
appendix. A discussion of the methodological status of the start symbol appears later on in this 
section, prior to the definition of languages. 

For the following discussion we fix a particular grammar G = (1Z,A S ). We define a derivation 
relation over AMRSs as the basis for defining the language of TFS-based grammars. Checking 
whether two given AMRSs A and B stand in the derivation relation is accomplished by the following 
steps: first, an element of A has to be selected; this element has to unify with the head of some 
rule p; then, a sub-structure of B is selected; this substructure has to unify with the body of p. All 
unifications are done in context, so that other components of the AMRSs involved may be affected, 
too. Moreover, there must be some way to record the effects of successive unifications; to this end, 
derivation is defined only for pairs of AMRSs that are already "as specific as needed" ; that is to 
say, if the rule adds any information to the AMRSs, this information already has to be recorded 

1 C at{wi) is a singleton if w t is unambiguous. 

2 We assume that there is no reentrancy among lexical items. 

3 This use of head must not be confused with the linguistic one, the core features of a phrase. 

4 Notice that the traditional direction is reversed and that the head and the body need not be disjoint. 
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A sub-structure of A is obtained by selecting a subsequence of the indices of A and considering 
the structure they induce. Trivially, this structure is an AMRS. We use A J " k to refer to the sub- 
structure of A induced by {j, . . . , k}. If Ind B = {«}, A l A can be identified with an AFS, denoted 
A i . 

The notion of concatenation has to be defined for AMRSs, too. Notice that by definition, 
concatenated AMRSs cannot share elements between them. 

Definition 3.5 (Concatenation) The concatenation of A = (Ind A , 11^, 0^, ta A ) and B = 
(Ind B , lis, B , ~b) of lengths n A ,n B , respectively (denoted by A ■ B), is an AMRS C = 
(Indc, lie, ©c, ~c) such that 

• Ind c = {1, • • • , n A + n B } 



• k, c = tx A U{((ii + n A ,TTi), (i 2 + n A ,ir 2 )) \ (h, fi) ~s (h,^)} 

As usual, A ■ X = X ■ A = A. 

We now extend the definition of unification to AMRSs: we want to allow the unification of two 
AM RSs, according to a specified set of indices. Therefore, one operand is a pair consisting of an 
AMRS and a set of indices, specifying some elements of it. The second operand is either an AMRS 
or an AFS, considered as an AMRS of length 1. Recall that due to reentrancies, other elements of 
the first AMRS can be affected by this operation. Therefore, the result of the unification is a new 
AMRS. We refer to AMRS unification as "unification in context" in the sequel to emphasize the 
effect that the operation might have on elements that are not directly involved in it. 

Definition 3.6 (Unification of AMRSs) Let A = (Ind A ,Tl A , a ,~a) be an AMRS. Let B = 
(Ind B , lis, B , ~b) be an AMRS (if B is an AFS it is interpreted as an AMRS of length 1). Let 
J be a set of indices such that J C Ind A . Let f(i) = i if B is an AMRS, f(i) = 1 if B is an AFS. 
(A, J) U B is defined if B is an AMRS and J C Ind B , or if B is an AFS and \ J\ = I; in any case, 
it is the AMRS C = Ty(Eq(Cl((Indc ,H C ,O c ,«c)))), where 

• Indc = Lnd A 

• n c = U A U {(i, tt) I i £ J and (f(i), tt) £ U B 



• & C = ~A U{((ii, 7Ti), (i 2 , 7r 2 )) | h,i 2 G J and tti) «b (Z^),^)} 

The unification fails if there exists some pair (i, tt) £ lie" such that Oc'{i,^) = T. 

Many of the properties of AFSs, proven in the previous section, hold for AMRSs, too. In 
particular, if A, B are AMRSs then so is (A, J) U B if it is defined, len((A, J) U B) = len(A) and 
(A, J) U B □ A. Also, for every two AMRSs A, B, (A, {1 . . .len(A)}) U B = A iff B 1 -^n(A) □ A 

4 Parsing 

Parsing is the process of determining whether a given string belongs to the language defined by a 
given grammar, and assigning a structure to the permissible strings. Various parsing algorithms 
exist for various classes of grammars. In this section we formalize and explicate some of the notions 
of [3, chapter 13]. We give direct definitions for rules, grammars and languages, based on our new 



• n c = n A u {(i + n A , tt) I (i, tt) e n B } 
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Definition 3.3 (Abstract multi-rooted structures) A pre- abstract multi rooted struc- 
ture (pre-AMRS) is a quadruple A = (Ind, II, 0, ps), where: 

• Ind, the indices of A, is the set {I,. . .,n} for some n 

• II C Ind x Paths is a set of indexed paths, such that for each i G Ind there exists some 
it G Paths that (i, it) g II. 

• : II — ► Types is a total type-assignment function 

• psC II x II is a relation 

An abstract multi-rooted structure (AMRS) is a pre-AMRS A for which the following require- 
ments, naturally extending those of AFSs, hold: 

• II is prefix- closed: if (i, ira) G II then (i, ir) G II 

• A is fusion-closed: if (i, it a) G II and (i',7r'a') G II and (i, 7r) ps (*',""') then (i, ira') G II (as 
well as (i' , ir'a) G H), and (i, ira') ps (i' , ir'a') ( as well as (i' , ir'a) ps (i, 7ra) ) 

• p» is an equivalence relation with a finite index 

• respects the equivalence: «/(ii,7Ti) ps (12,^2) then 0(ii,7i"i) = 0(«2,""2) 

The length of an AMRS A is /en(j4) = |7n(i^|. We use A to denote the empty AMRS, too, where 
Ind\ = <f> and IIa = <f> (so that len(X) = 0). 

The closure operations CI and Eq are naturally extended to AMRSs: If A is a pre-AMRS then 
Cl(A) is the least extension of A that is prefix- and fusion-closed, and Eq(A) is the least extension 
of A to a pre-AMRS in which ps is an equivalence relation. In addition, Ty((Ind, II, 0, ps}) = 
(Ind, II, 0', ps) where 0'(i, 7r) = LJ^-, ^r^^ w j 0(«', tt'). The partial order ^ is extended to AMRSs: 
(Ind A , U A , Q A , Ki A ) -< (Ind B , n B , Q B , ~s) iff ind a = 7nrf_B,II^ C II B ,»^C»i B and for every 
(i, 7r) G 11^4, 0^4 (i, 7r) C 0s(i, 7r). In the rest of this paper we overload the symbol 'C' so that it 
denotes subsumption of AMRSs as well as MRSs. 

AMRSs, too, can be related to concrete ones in a natural way: If a = (Q,G) is a MRS then 
Abs(a) = (Ind a , n<j, CT , ps<j) is defined by: 

. Ind a = {1,...,|Q|} 

. Il a = {(i,ir) 18(^^)1} 

. e <7 (i,w) = 6(6(q i ,w)) 

• (i, 7Ti) « CT (j, 7r 2 ) iff 8(qi,iri) = 6{qj,ir 2 ) 

It is easy to see that Abs(a) is an AMRS. In particular, notice that for every i G Ind a there exists 
a path 7r such that (i, 7r) G n<j since for every i, 6(qi, e) {. The reverse operation, Cone, can be 
defined in a similar manner. 

AMRSs are used to represent ordered collections of AFSs. However, due to the possibility of 
value sharing among the constituents of AMRSs, they are not sequences in the mathematical sense, 
and the notion of sub-structure has to be defined in order to relate them to AFSs. 

Definition 3.4 (Sub-structures) let A = (Ind A , H A , Q A , Ki A ); let Inds be a finite (contiguous) 
subset of Ind A ; let n + l be the index of the first element of IndB- The sub-structure of A induced 
by Inds is an AMRS B = (Inds, T1b,Qb,~b) such that: 

• (i — n, 7r) G lis iff i G Inds and (i, 7r) G A 

• 0s(* — n, it) = Q A (i, 7r) if i G IndB 

• (ii - n, 7Ti) &B (12 ~ n, 7r 2 ) iff ii G Ind B ,%2 G Ind B and , tti) &a (h, ^2) 
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Definition 3.1 (Multi-rooted structures) A multi-rooted feature structure (MRS) is a 
pair (Q,G) where G = (Q,6) is a finite, directed, labeled graph consisting of a set Q C Nodes of 
nodes and a partial function S : Q x Feats — ► Q specifying the arcs, and where Q is an ordered, 
(repetition-free) list of distinguished nodes in Q called roots. G is not necessarily connected, but 
the union of all the nodes reachable from all the roots in Q is required to yield exactly Q. The 
length of a MRS is the number of its roots, \Q\. X denotes the empty MRS, where Q = <f>. 

Meta-variables a,p range over MRSs, and S,Q and Q over their constituents. If (Q,G) is a 
MRS and qi is a root in Q then qi naturally induces a feature structure Pr(Q,i) = (Qi, qi, Si), 
where Qi is the set of nodes reachable from qi and Si = 6\q { . 

One can view a MRS (Q, G) as an ordered sequence {A\, . . . , A n ) of (not necessarily disjoint) 
feature structures, where Ai = Pr(Q,i) for 1 < i < n. Note that such an ordered list of feature 
structures is not a sequence in the mathematical sense: removing an element from the list may 
effect the other elements (due to reentrancy among elements). Nevertheless, we can think of a MRS 
as a sequence where a subsequence is obtained by taking a subsequence of the roots and considering 
only the feature structures they induce. We use the two views interchangeably. Figure 3 depicts 
a MRS and its view as a sequence of feature structures. The shaded nodes (ordered from left to 
right) constitute Q. 




Figure 3: A graph- and AVM- representation of a MRS 

Subsumption is extended to MRSs as follows: 

Definition 3.2 (Subsumption of multi-rooted structures) A MRS a = (Q,G) subsumes 
a MRS a' = (Q',G'} (denoted by a El a') if \Q\ = \Q'\ and there exists a total function h : Q — ► Q' 
such that: 

• for every root qi £ Q, h(qi) = q\ 

• for every q £ Q, 9(q) C 6' (h(q)) 

• for every q £ Q and f £ Feats, if S(q, /)]. then h(S(q, /)) = S'(h(q), /) 

We define abstract multi-rooted structures in an analog way to abstract feature structures. 
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Proof: as above. 

The result of a unification can differ from any of its arguments in three ways: paths that were 
not present can be added; the types of nodes can become more specific; and reentrancies can be 
added, that is, the number of equivalence classes of paths can decrease. Consequently, the result 
of a unification is always more specific than any of its arguments. 

Theorem 2.32 If C = AU B then A < C . 

Proof: He = 11^ LUlg and hence 11^ C He . ~c = ~A U ~_b and hence &aQ~c- If tt 6 11^ then 
©c(7r) = (tt) or 0c (tt) = 0^ (tt) U ©5(71"), and in any case 0^(7r) C 0c (71"). CI and Eq cannot 
remove paths or equivalences and Ty only makes types more specific, and therefore A ^ C" . 

Theorem 2.33 AU B = A iff B < A. 

Proof: Suppose B < A. Then Hb C H A , &bQ~a and for every 7r £ Hb,Qb(^) Q 0j(?t). 
A U B = Ty(Eq(Cl(C))) where C = (n c , 6c, ~c) and 

• n c = n A un B = n 4 

{0A(7r)U0 B (7r) if 7T £ U A and tt £ H b 
0a(tt) if tt £ H A only = a (tt) 

Ob(t) if 7r G lis only 

• «c=~a U & B =& A 

Hence A = C and therefore A U B = A. 

Suppose A U B = A and assume toward a contradiction that B ^ A. Then at least one of the 
following cases holds: 

• Hb ^ H-A- Then there exists 7r £ Hb U 11^ that 7r ^ 11^ and hence AU B ^ A. 

• There exists some 7r such that ©5(71") ^ 6^(71"). Then 0^(7r) U Ob (ft) ^ 0j(t) and hence 

• Ri^^Ri^. Then there exist ft\,ft2 such that (7Ti &b ""2) but not (7Ti Ki A ""2)- Hence (pa^ 
U k, b ) ±k, a and AU5^A. 

TFSs (and therefore AFSs) can be seen as a generalization of first-order terms (FOTs) (see [1]). 
Accordingly, AFS unification resembles FOT unification; however, the notion of substitution that 
is central to the definition of FOT unification is missing here, and as far as we know, no analog to 
substitutions in the domain of feature structures was ever presented. 

3 Multi-rooted Structures 

To be able to represent complex linguistic information, such as phrase structure, the notion of 
feature structures has to be extended. HPSG does so by introducing special features, such as 
DTRS (daughters), to encode trees in TFSs. This solution requires a declaration of the special 
features, along with their intended meaning; such a declaration is missing in [14]. An alternative 
technique is employed by Shieber ([16]): a denumerable set of special features, namely 0, 1, . . ., 
are added to encode the order of daughters in a tree. In a typed system such as ours, this method 
would necessitate the addition of special types as well; in general, no bound can be placed on the 
number of features and types necessary to state rules. 

As a more coherent, mathematically elegant solution, we define the notion of multi-rooted 
structures that naturally extend TFSs. These structures provide a means to represent phrasal 
signs and grammar rules. They are used implicitly in the computational linguistics literature, but 
to the best of our knowledge no explicit, formal theory of these structures and their properties was 
formulated before. 
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Definition 2.26 (Unification) The unification AU B of two AFSs A = (ILa, 0^, ps^} and B = 
(IIb,Ob,~b) is the AFS C = Ty(Eq(Cl(C))), where: 

• c = (n c ,e c ,^c) 

• n c = n i un B 

{Qa(tt) U 6_b(7t) i/jtGIIa and ir £ n B 
6^(7r) j/jtEIIa on/i/ 

©_b(7t) if it £ lis cm/?/ 

• PS C =»^ U 

TAe unification fails if there exists a path tt £ II c" smc/j <Aa< Qc'i^) = T. 

Lemma 2.27 CI preserves prefixes: If A is a prefix-closed pre-AFS and A' = Cl(A) then A' is 
prefix-closed. 

Proof: Let f be a path in II'. If 7r £ II then every prefix of 7r is in II', since II is prefix-closed 
and CI only adds paths. Suppose that 7r £ II' \ II. Then there exist 7Ti, 7T2, «i, «2 £ Paths such 
that TTiai £ II and 7r2«2 £ II and 7Ti ps 7T2 and 7r = 7Tia2 (otherwise, 7r can be removed from II', 
in contradiction to the minimality of CI). If ir' is a prefix of ir than either ir' is a prefix of ir\, in 
which case ir' £ II since II is prefix-closed, or ir' = 7Tia' for some a' that is a prefix of a. Since II 
is prefix-closed, ir\u' £ II and iT2a' £ II. Therefore, as 7Ti ps 7T2, 7Tia' is added to II' by the closure 
operation. 

Lemma 2.28 Eq preserves prefixes and fusions: If A is a prefix- and fusion-closed pre-AFS and 
A' = Eq(A) then A' is prefix- and fusion-closed. 

Proof: Eq extends ps to an equivalence relation. Since only ps is modified, prefix-closure is trivially 
maintained. Select a pair (tti, 7^) £ ~' \ ~. Then either (1) 7T2 = tti; (2) 7T2 ~ tti; or (3) there exists 
a path 7T3 such that 7Ti ps its and its p» 7T2. Trivially, (1) and (2) preserve the closure properties. In 
the case of (3), to show that fusion-closure is maintained we have to show that if ttiUi £ II' and 
7r2«2 £ II' then Tt\at2 £ II' and Tt\at2 ~' 7T2«2- Since II = II',7riai £ II and 7r2«2 £ II. Since II is 
fusion-closed and 7T2 ~ 7T3, TTsa2 £ II and TTsa2 ~ 7T2«2- Since 7Ti ps its, Tt\at2 £ II and Tt\at2 ~ ^3(^2, 
too. ps' is an extension of ps to an equivalence relation, and thus Tt\at2 ~' ^2<^2- 

Corollary 2.29 If A and B are AFSs, then so is A U B. 

Proof: If A and B are AFSs then the pre-AFS C , defined as in 2.26, is prefix-closed (since A 
and B are). Cl(C) is prefix- and fusion-closed, as is Eq(Cl(C)) in which, additionally, ps is an 
equivalence relation. Ty(Eq(Cl(C))) is an AFS, since Ty only modifies such that it respects the 
equivalences. 

C" is the smallest AFS that contains lie and p»c- Since 11^ and II^ are prefix-closed, so is lie- 
However, lie and p»c might not be fusion-closed. This is why CI is applied to them. As a result 
of its application, new paths and equivalence classes might be added. By lemma 2.27, if a path is 
added all its prefixes are added, too, so the prefix-closure is preserved. Then, Eq extends ps to an 
equivalence relation, without harming the prefix- and fusion-closure properties (by lemma 2.28). 
Finally, Ty sees to it that respects the equivalences. 

Lemma 2.30 Unification is commutative: A U B = B U A. 

Proof: Observe that unification is defined using set union (U) and type unification (U) which are 
commutative. Therefore, the unification is commutative, too. 

Lemma 2.31 Unification is associative: (A\JB)UC = A\J(BUC). 
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AFSs can be partially ordered: (U A ,0 A ,tt A ) d: (H_b, 0_b, ~b) iff 11^ C Ub,~aQ~b and for 
every 7r G n^,0^(7r) C 0s(7r). This order corresponds to the subsumption ordering on TFSs, as 
the following theorems show. 

Theorem 2.19 A C B iff Abs(A) < Abs(B). 

Proof: Let Abs(A) = (U A , Q A , & A ), Abs(B) = (U b ,Ob,~b)- Assume that A C B, that is, a 
subsumption morphism h : Q A Qb exists. If 7r G then (from the definition of Abs(A)) 
6a(qa, t) I, that is, there exists a sequence qo, q\, . . . , q n of nodes and a sequence fi, . . . , f n of 
features such that for every i, < i < n, 6 A (qi, fi+i) = qi+i, qo = Qa an d tt = /i •••/«. 
Due to the subsumption morphism, there exists a sequence of nodes h(qo), . . . , h(q n ) such that 
^B(h(qi), fi+i) = h(qi + i) for every i, < i < n, and h(qo) = qs- Hence 7r G Wb- Moreover, since 
A C B, for every node q, 9(q) C 9(h(q)). In particular, #(</„) E $(M9n)) an d thus 0^(7r) C 0s(7r). 
Now suppose that two paths 7Ti,7T2 are reentrant in A. By the definition of subsumption, 7Ti and 
7T2 are reentrant in B, too. Therefore ps^Cpsb. 

If Abs(A) d Abs(B), construct a function h : Q A — ► Qb such that h(q A ) = </s and for every 
q e QA,h(6 A (q, f)) = 8 B (h(q),f). Trivially, /i is total and h(q A ) = q B . Also, if 6 A (q,f)[ then 
h(8 A (q, /)) = 8B(h(q), /). To show that C 6(h(q)) for every </, consider a path 7r leading 

from (^4 to </. Since j4&s(j4) ^ j4&s(5), 0^(7r) C 0s(7r) and hence #(<?) C 6(h(q)). Hence fc is a 
subsumption morphism. 

Theorem 2.20 For every A G Conc(A'),B G Conc(5'),AC B iff A < B' . 

Proof: Select some A G Conc( J 4'),5 G Conc(B'). IfiCfl then, by theorem 2.19, A6s(A) ^ 
A6s(5). By the definition of Cone, Abs(A) = A' and Abs(B) = B' , so that A -<B' . 
Xi A -< B' , construct a function h : Q A — ► Qb as follows: First, let h(q A ) = qs- Then, perform 
a depth-first search on the graph A and for every node q' = 6 A (q, f) encountered, if h(q')] set 
h(q') = bB{h(q), /). The order of the search is irrelevant: since A ■< -B',~a'C~B' and therefore 
if 7Ti Ki A t 7T2 then 7Ti KiB' ""2- Since A' ^ B' , XX A i C n^/ and hence bsih^q), f) is defined whenever 
6 A (q, f) is defined. Hence h is total and h(q A ) = For every node q G Qa, some path 7r exists 
that leads from q A to </ and from qs to /i(</). 0^(7r) C 0s(7r), and therefore #((/) C 9(h(q)). Hence 
/i is a subsumption morphism. 

Corollary 2.21 A ~ 5 tff A6s(A) = A6s(5). 

Proof: Immediate from theorem 2.19. 

Corollary 2.22 Conc(A) ~ Conc(B) iff A = B. 

Proof: Immediate from theorem 2.20. 

2.4 Unification 

As there exists a one to one correspondence between abstract feature structures and (alphabetic 
variants of) concrete ones, we define unification directly over AFSs. This leads to a simpler 
definition that captures the essence of the operation better than the traditional definition. We use 
the term 'unification' to refer to both the operation and its result. 

Lemma 2.23 If A = (XX A , Q A , o, pre-AFS then there exists a pre-AFS B = (XXb,Qb,~b) 

such that B is the least extension of A to a fusion-closed structure and Qb(^) = 0A(f) for every 

7T G ll A . 

Lemma 2.24 If A = (IL A , Q A , o- pre-AFS then there exists a pre-AFS B = (IT-b,Qb,~b} 

such that U A = Ub,Q A = 0_b and &b is the least extension of Ki A to an equivalence relation. 

Definition 2.25 (Closure operations) let CI be a fusion- closure operation on pre-AFSs: 
Cl(A) = A , where A is the least extension of A to a fusion-closed structure. let Eq((U, 0, ps)) = 
(n,0,ps')) where ps' is the least extension of ps to an equivalence relation. let Ty((U, 0, ?»)) = 
(n,0',«) «Aere0'(7r) = U^e(7r). 
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For the reverse direction, consider an AFS A = (II,0,ps). First construct a 'pseudo-TFS', 
Conc(A) = (Q,q,8), that differs from a TFS only in that its nodes are not drawn from the set 
Nodes. Let Q = {q[ w ] \ [it] £ [~]}, making use of the fact that 'rs' is of finite index. Let 
9(q[ w ]) = 0(-7r) for every node - since A is an AFS, respects the equivalence and therefore 6 
is representative-independent. Let q = and S^q^, f) = g 1 ^/] for every node and feature 
/. Since A is fusion-closed, 5 is representative-independent. By injecting Q into Nodes, making 
use of the richness of Nodes, a concrete TFS Conc(A) is obtained, representing the equivalence 
class of alphabetic variants that can be obtained that way. We abuse the notation Conc(A) in the 
sequel to refer to this set of alphabetic variants. Figure 2 depicts an example feature structure, 
represented both as an AVM and as a graph, along with its abstraction. 
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Figure 2: A feature structure 



Theorem 2.18 If A' e Conc(A) then Abs(A') = A. 

Proof: Let A = {U A , A , &a), A' = (Q, q, 6), Abs(A') = (n,0,«). If A' £ Conc(A) then Q can 
be mapped by a one-to-one function to the set of equivalence classes of th^ an d 6 determines the 
paths in n^. By the definition of Abs, n = n^. Given a path 7r £ n^,0(7r) = 0(b(q,Tt)) = 
#((/[„-]) = Qa(tt). If 7Ti KiA ""2 then 8(q, 7Ti) = 8(q, ^2) (since A is fusion-closed) and hence 7Ti ps 7T2. 
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2.3 Abstract Feature Structures 

The essential properties of a feature structure, excluding the identities of its nodes, can be captured 
by three components: the set of paths, the type that is assigned to every path, and the sets of paths 
that lead to the same node. In this section we elaborate on ideas presented in [12]; in contrast 
to the approach pursued in [3], we first define abstract feature structures and then show their 
relation to concrete ones. The representation of graphs as sets of paths is inspired by works on the 
semantics of concurrent programming languages, and the notion of fusion-closure is due to [4]. 

Definition 2.15 (Alphabetic variants) Two feature structures A and B are alphabetic vari- 
ants (A~ B) iff A\Z B and B \Z A. 

Alphabetic variants have exactly the same structure, and corresponding nodes have the same types. 
Only the identities of the nodes distinguish them. 

Definition 2.16 (Abstract feature structures) A pre- abstract feature structure (pre- 
AFS) is a triple (n,0,ps), where 

• II C Paths is a non-empty set of paths 

• : II — ► Types is a total function, assigning a type to every path 

• psC II x II is a relation specifying reentrancy 

An abstract feature structure (AFS) is a pre-AFS A for which the following requirements hold: 

• II is prefix- closed: if it a £ II then tt £ II (where ir, a £ Paths ) 

• A is fusion-closed: if it a £ II and ir'a' £ II and tt ps ir' then ira' £ II (as well as ir'a E H) 
and ira' ps ir'a' (as well as ir'a ps it a) 

• p» is an equivalence relation with a finite index (with [ps] the set of its equivalence classes) 

• respects the equivalence: if it\ ~ 7T2 then Q(tti) = ©(7^) 

Abstract features structures can be related to concrete ones in a natural way: If A = (Q, q, 8) 
is a TFS then Abs(A) = (ILa, 0^, ps^} is defined by: 

. n A = {7r|%-,7r)j} 

. e A (*) = 0(6(q,*)) 

• 7Tl K A 7T 2 iff 6(q, 7Tl) = 6(q, 7T 2 ) 

Lemma 2.17 If A is a feature structure then Abs(A) is an abstract feature structure. 
Proof: 

1. II is prefix-closed: LT = {tt | 8(q, tt)1}. If 7ra £ II then b(q,i:a){ and by the definition of 6, 
S(q, tt)|, too. 

2. Abs(A) is fusion-closed: Suppose that 7ra £ II, ir'a' £ II and 7r ps 7r'. Then 8(q, ir) = 8(q, ir'). 
Hence b(q,i:a r ){ (therefore ira' £ II), and 6(q,ira') = 6 (ir'a 1 ), therefore ira' ps ir'a'. In the 
same way, ir'a £ II and ir'a ps ir'a' . 

3. p» is an equivalence relation with a finite index: 7Ti ps 7T2 iff 8(q, 7Ti) = 8(q, 7^), namely iff 7Ti 
and 7T2 lead to the same node (from q) in A. Hence ps is an equivalence relation and since Q 
is finite, ps has a finite index. 

4. respects the equivalence: 0(7r) = 0(b(q,Tt)) and if 7Ti ps 7T2 then <5(g, 7Ti) = 8(q, 7^), hence 

6(7Ti) = 0(7T 2 ). 
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Figure 1: An infinite decreasing sequence of TFSs 



By lemma 2.10, rank is well defined for acyclic TFSs. A(A) can be thought of as the number of 
reentrancies in A, or the number of different paths that lead to the same node in A. For every 
acyclic TFS A, A(A) > and hence rank(A) > 0. 

Lemma 2.12 If A \Z B and both are acyclic then rank(A) < rank(B). 

Proof: Assume A \Z B and both are acyclic; hence by lemma 2.7, 11(A) C 11(5) and by lemma 2.10 
both are finite. Let h : Qa —> Qb be a subsumption morphism. 

1. If n(A) = n(5) then 1 11(A) | = |II(5)|. A C B, hence either there exists a node q £ Qa 
that 6(q) C 0(h(q)), and hence 6(A) < 6(5) (while A(A) < A(5)); or (by lemma 2.8) there 
exist two paths 7Ti,7T2 that 6a{<1a, ""i) 7^ ^a{<1a,^2), but 6b{<1b, ""i) = ^b{<1b,^2), in which 
case A(A) < A(5) (while 0(A) < 0(5)). In any case, ranfc(A) < rank(B). 

2. If n(A) C n(5) then 1 11(A) | < |n(5)|; as above, 0(A) < 0(5). However, it might be the case 
that \Qa \ < But for every node q <EQb that is not the image of any node in Qa, there 
exists a path 7r such that 6(qb, = q and 7r ^ 11(A). Hence 1 11(A) | — \Qa\ < |H(5)| — |Q_b|, 
and rank(A) < rank(B). 

Theorem 2A3 Subsumption of TFSs is not well-founded. 

Proof: Consider the infinite sequence of TFSs Ao, A\, . . . depicted graphically in figure 1. For 
every i > 0, A 8 - □ Ai + \. to see that consider the morphism h that maps qi+i to and 8i + i(q, /) 
to 8i(h(q), /) (i.e., the first i + 1 nodes of A;+i are mapped to the first i + 1 nodes of A;, and the 
additional node of A;+i is mapped to the last node of A;). Thus there exists a decreasing infinite 
sequence of cyclic TFSs and subsumption is not well-founded. 

Theorem 2A4 Subsumption of acyclic TFSs is well-founded. 

Proof: For every acyclic TFS A, rank(A) is finite and rank(A) > 0. By lemma 2.12, if A C 5 
then rank(A) < rank(B). If an infinite decreasing sequence of acyclic TFSs existed, rank would 
have mapped them to an infinite decreasing subsequence of IN , which is a contradiction. Hence 
subsumption is well-founded. 
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as nodes with no outgoing edges. For a discussion regarding the implications of such an approach, 
refer to [3, Chapter 8]. 

Definition 2.3 (Paths) A path is a finite sequence of feature names, and the set Paths = 
Feats* is the collection of paths. We use ir,a (with or without subscripts) to refer to paths, e is 
the empty path. The definition of 8 is extended to paths in the natural way: 

S(q,e) = q 

6(q,fw) = 6(6(q,f),w) 

The paths of a feature structure A are 11(^4) = {tt \ tt £ Paths and 6(qa, ^i}- 

Definition 2.4 (Cycles) A feature structure A = (Q,q,6) is cyclic if there exist a non-empty 
path a £ Paths and a node q £ Q such that 8(q, a) = q. It is acyclic otherwise. 

Definition 2.5 (Reentrancy) A feature structure A is reentrant iff there exist two different 
paths 7Ti,7T2 such that 8(q, ttx) = 8(q, 7^). In this case the two paths are said to share the same 
value. 

2.2 Subsumption 

Definition 2.6 (Subsumption) Ai = (Qx, qx, 8\) subsumes A^ = (Q2, §2, ^2) (denoted by A\ C 
A2) iff there exists a total function h : Qx — ► Q2, called a subsumption morphism, such that 

• M<Zi) = 92 

• for every q £ Q\, 9(q) C 9(h(q)) 

• for every q £ Q\ and for every f such that 8\{q, f)[, h(8i(q, /)) = 82(h(q), /) 
Ax C A 2 iff Ax C A 2 and Ax ^ A 2 . 

h associates with every node in Qx a node in Q2 with at least as specific a type; moreover, if an 
arc labeled / connects q with q' , then such an arc connects h(q) with h(q'). If A C B then every 
path defined in A is defined in B, and if two paths are reentrant in A they are reentrant in B. 

Lemma 2.7 If A Q B then 11(A) C 11(B). 

Lemma 2.8 If A\Z B then for every ttx,^2 £ H(j4), if 6a(qa, ""1) = ^a(qa, ""2) then 6b(<1b, ""1) = 
MSB, ^2). 

Definition 2.9 A partial order y on D is well-founded iff there exists no infinite decreasing 
sequence do y c?i y c?2 y ■ ■ ■ of elements of D. 

We prove below that subsumption of TFSs is well-founded iff they are acyclic. 
Lemma 2.10 A TFS A is acyclic iff H(A) is finite. 

Proof: If A is cyclic, there exist a node q £ Q and a non-empty path a that 8(q,a) = q. Let 
7r = 8(q,q), then the infinite set of paths {ira* \ 1 : > 0} is contained in II(j4). If II(j4) is infinite 
then since Q is finite, there exists a node q £ Q that 8(q, 7r 8 ')|. for an infinite number of different 
paths 7T 8 '. Since Feats is finite, the out-degree of every node in Q is finite; hence q must be part 
of a cycle. 

Definition 2.11 (Rank) let r : Types IN be a total function such that r(t) < r(t') if t C t' . 
For an acyclic TFS A, let A(A) = \T1(A)\ - \Q A \ and let 0(A) = E^ G n(A) r ( e ( 6 ( r F "")))• Define a 
rank for acyclic TFSs: rank(A) = A(A) + Q(A). 
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thus endowed with an operational semantics. Next, we prove that both semantics coincide. Finally, 
we discuss the class of grammars for which computations terminate. We give a more relaxed defini- 
tion for off-line parsability and prove that termination is guaranteed for off-line parsable grammars. 
The appendix contains a few examples of grammars and parsing. 
The main contributions of this paper are: 

• Formalization and explication of the notion of multi-rooted feature structures that are used 
implicitly in the computational linguistics literature; 

• Concise definitions of a TFS-based linguistic formalism, based on abstract MRSs; 

• Algebraic specification of a parsing step operator, Tg, w , that induces algebraic semantics for 
this formalism; 

• Treatment of parsing as a model for computation, assigning operational semantics to the 
linguistic formalism; 

• Specification and correctness proofs for parsing in this framework; 

• A new definition for off-line parsability, less strict than the existing one, and termination 
proof for off-line parsable grammars. 

2 Theory of Feature Structures 
2.1 Feature Structures 

The first part of this section summarizes some preliminary notions along the lines of [3]. For the 
following discussion we fix non-empty, finite, disjoint sets Types and Feats of types and feature 
names, respectively. We assume that the set Feats is totally ordered. We also fix an infinite set 
Nodes of nodes, disjoint of Types and Feats, each member of which is decorated by a type from 
Types through a fixed typing function 9 : Nodes — ► Types. The set Nodes is 'rich' in the sense 
that for every t £ Types, the set {q £ Nodes | 6(q) = t} is infinite. 

Below, the metavariable T ranges over subsets of types, t - over types, / - over features and q 
- over nodes. When dealing with partial functions the notation c F(x) J,' means that F is defined 
for the value x and the symbol 'f means undefinedness. Whenever the result of an application of 
a partial function is used as an operand, it is meant that the function is defined for its arguments. 

Definition 2.1 (Type hierarchy) A partial order C over Types x Types is a type hierarchy 
(or inheritance hierarchy) if it is bounded complete, i.e., if every up-bounded subset T o/Types 
has a (unique) least upper bound, UT, referred to as the unification of the types in T. 

If t\ C t'2 we say that t\ subsumes, or is more general than, t^; t'l is a subtype of (more 
specific than) t\. 

Let _L = U<f> be the most general type. Let the most specific type be T = UTypes. If UT = T 
we say that T is inconsistent. 

Definition 2.2 (Feature structures) A feature structure is a directed, connected, labeled 
graph consisting of a finite, nonempty set of nodes Q C Nodes, a root q £ Q, and a partial 
function 8 : Q x Feats — ► Q specifying the arcs such that every node q £ Q is accessible from q. 

The nodes of a feature structure are thus labeled by types while the arcs are labeled by feature 
names. The root q is a distinguished node from which all other nodes are reachable. A feature 
structure is of type t when 9(q) = t. When we say that a feature structure A exists we mean that 
no node of A is typed T. 

We use upper-case letters (with or without tags, subscripts etc.) to refer to feature structures. 
We use Q, q, 8 (with the same tags or subscripts) to refer to constituents of feature structures. 

Note that all feature structures are, by definition, graphs. Some grammatical formalisms used 
to have a special kind of feature structures, namely atoms; atoms are represented in our framework 
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Abstract 

In this paper we provide for parsing with respect to grammars expressed in a general TFS- 
based formalism, a restriction of ALE ([2]). Our motivation being the design of an abstract 
(WAM-like) machine for the formalism ([19]), we consider parsing as a computational process 
and use it as an operational semantics to guide the design of the control structures for the 
abstract machine. 

We emphasize the notion of abstract typed feature structures (AFSs) that encode the 
essential information of TFSs and define unification over AFSs rather than over TFSs. We then 
introduce an explicit construct of multi-rooted feature structures (MRSs) that naturally 
extend TFSs and use them to represent phrasal signs as well as grammar rules. We also 
employ abstractions of MRSs and give the mathematical foundations needed for manipulating 
them. We formally define grammars and the languages they generate, and then describe a 
model for computation that corresponds to bottom-up chart parsing: grammars written in the 
TFS-based formalism are executed by the parser. We show that the computation is correct 
with respect to the independent definition. Finally, we discuss the class of grammars for which 
computations terminate and prove that termination can be guaranteed for off-line parsable 
grammars. 

1 Introduction 

Typed feature structures (TFSs) serve as a means for the specification of linguistic information in 
current linguistic formalisms such as HPSG ([14]) or Categorial Grammar ([8]). They are used for 
representing lexical items, phrases and rules. Usually, no mechanism for manipulating TFSs (e.g., 
parsing or generation algorithms) is inherent to the formalism. Current approaches to processing 
HPSG grammars either translate the grammar to Prolog (e.g., [2, 5, 6]) or specify it as a general 
constraint system ([21]). 

In this paper we provide for parsing with respect to grammars expressed in a general TFS-based 
formalism, a restriction of ALE ([2]). Our motivation is the design of an abstract (WAM-like) 
machine for the formalism ([19]); we consider parsing as a computational process and use it as an 
operational semantics to guide the design of the control structures for the abstract machine. In 
this paper the machine is not discussed further. 

Section 2 outlines the theory of TFSs of [1, 3]. We emphasize abstract typed feature struc- 
tures (AFSs) that encode the essential information of TFSs and extend unification to AFSs. 
Section 3 introduces an explicit construct of multi-rooted feature structures (MRSs) that 
naturally extend TFSs, used to represent phrasal signs as well as grammar rules. Abstraction is 
extended to MRSs and the mathematical foundations needed for manipulating them is given. The 
concepts of grammars and the languages they generate are formally defined, and the TFS-based 
formalism is thus acquired a denotational semantics. In section 4 a model for computation, corre- 
sponding to bottom-up chart parsing for the formalism, is presented. The TFS-based formalism is 

*A preliminary version of this paper appeared as [20]. 
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